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Abstract 

We characterise the class of distributions of random stochastic matrices X 
with the property that the products X(n)X(n — 1) • • • X(l) of i.i.d. copies 
X(k) of X converge a.s. as n — > oo and the limit is Dirichlet distributed. This 
extends a result by Chamayou and Letac (1994) and is illustrated by several 
examples that are of interest in applications. 
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1 Introduction 

Let X be a random d x d matrix with non- negative entries, and {X(n)} n >i be an 
i.i.d. sequence of random matrices with the same distribution as X. In this paper, 
we study the limit of the left products 

X(n,l) := X(n)X(n-l)---X(l) (1) 

as n — » oo for a certain class of random matrices X. Clearly, the distribution of ([I]) 
is equal to that of right product 

X(l,n):=X{l)X{2)---X(n), (2) 

and therefore any assertions concerning the distribution of (pQ) as n — > oo apply to 
that of ([2]) as well. 

The asymptotic behaviour of the left products ([T]) was apparently first studied by 
Bellman [2], who showed, under certain conditions, that Hindoo n _1 E \ogX(n, l)jj 
exists, where X(n, is the entry in the ith row and jth column of X(n, 1). This 
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result was later strengthened to almost sure convergence in [9] , where the behaviour 
of the norms ||AT(n, 1)|| := maxj< d J^ =1 |X(n, was studied. 

Several equivalent conditions for convergence in distribution of the right products 
([2]) without normalisation were given in [12]. See also [3] and [TS] for surveys on the 
properties of the products §1§ and (j2J). 

We consider the case when the random matrix X is stochastic, i.e., where all 
the row sums of X equal one. The study of the products (JTJ in this case was 
apparently initiated by Rosenblatt in |21j . who applied existing results for compact 
semigroups. Recent results on the infinite product of deterministic and random 
stochastic matrices can be found in [23] . 

The class of stochastic matrices is a semigroup under matrix multiplication, 
and therefore the products ([I]) and ([2]) generate a left and a right random walk on 
stochastic matrices by 

X(n,l) (3) 

and 

n h> X(l,n), (4) 

respectively. 

The right random walk (T4]) is central to several related problems involving dis- 
tributed averaging. These include distributed computation, distributed optimiza- 
tion, distributed estimation, and distributed coordination (see [23] and references 
therein). They are also related to certain Markov processes called Potlatch models. 
These models were introduced in [TTJ and [16], and are described in their simplest 
form in [T2] . We will discuss these models further in Section 14.11 A comprehen- 
sive reference for random walks on stochastic matrices and more general semigroups 
is [TU], where the transient random walk generated by random stochastic matrices 
satisfying conditions [I]— [III] below is presented in Appendix B. 

In [6], Chamayou and Letac study the left products (JTJ for random stochastic 
matrices X satisfying the following conditions: 

[I] The rows of X are independent. 
[II] The rows of X are Dirichlet distributed. 

[Ill] Letting (ct^i, . . . , a^) be the Dirichlet parameters of the ith row of X, we have 
J2 d j=1 onj = J2 d j=i otj,i for i = 1, . . . , d. 

It is shown in [6] that the above conditions are sufficient to ensure that: 

[Al] The products X(n, 1) converge a.s. to some random matrix Iasn4 oo. 
[A2] The limit X has identical rows a.s. 
[A3] The rows of X are Dirichlet distributed. 
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Unfortunately, conditions [I]- [III] are quite restrictive. Surprisingly, it turns out 
assertions [A1]-[A3] remain true under much broader conditions, and that none of 
the conditions [I]— [III] is necessary. 

Denote by the class of all distributions of a random d x d stochastic matrix 
X such that [A1]-[A3] hold. In this note, we extend the result in [6] by providing 
a charaterisation theorem (Theorem HJ) for the class Jfa- We find that J^l is much 
broader than the class of distributions satisfying [I]— [III]. In addition, once the 
Dirichlet parameters of the limiting distribution are known, it is elementary to verify 
that a given distribution belongs to the class J^. 

Chamayou and Letac's result above relies on a theorem (which we present as 
Theorem [1] below) that admits a rather elegant proof suggested by S. Lauritzen 
and also presented in [6]. The authors of [6] pointed out, using Erdos's remark on 
the existence of God's book for the best proofs, that S. Lauritzen's proof could be 
taken from that book. The proof of our Theorem [3J which contains Theorem [T] as a 
special case, is also rather simple and concise. Similar to S. Lauritzen's proof, it is 
based on an insightful observation (Lemma Q] below) that, in our case, extends the 
well-known result by Pitman on scale independent functions of Gamma distributed 
random variables (see |2Uj). 

The Dirichlet distribution appearing in the limit [A3] is widely used in statis- 
tical applications. These include the modelling of compositional data (see e.g. [1]), 
Bayesian analysis, statistical genetics, and nonparametric inference (see [19] for a 
recent survey on the Dirichlet distribution and its applications). 

The paper is structured as follows. Section [2] contains notation and the main 
results, while their proofs are presented in Section [3J Examples and applications 
to random exchange models, random nested simplices, and a service network with 
polling are presented in Section HJ 

2 Notation and Main Results 

For positive integers r and c, denote by Ai r ,c the set of r x c matrices (pjj)[ =1 J =1 
such that all pij > 0, and by V r , c C Ai r ,c its subclass of matrices with Y2j=iPi,j = ^ 
i = 1, . . . , r. Let -M+ c and P+. be the subclass of all positive matrices from Ai r ,c 
and P r c , respectively (by a positive matrix/vector we mean a matrix/vector with 
all positive entries), and set M + := (0, oo). Clearly, Vd := Vd,d consists of all d x d 
stochastic matrices, with Pj~ := V dd its subclass of positive stochastic matrices 
(we similarly define := M.d,d and M.\ := M.^ d ). For a matrix (Zij) G M. r ,ci 
we denote its row and column sums by Z i9 := YTj=i * = 1> • • • > r ; an d Z*j '■ = 
Y^i=\ Zi,ji j = 1) • • • > c i respectively Similarly, for a vector (Yi, . . . ,Y C ), we denote 
the sum of its components by Y m :— J2i=i Yi- 

For a random element X, we denote its distribution by C(X), and write X ~ F 
if F — C(X). If Y ~ Fy is independent of X ~ Fx, we write Fx <8> Fy for the law 
of (X,Y). 

We denote by T u the Gamma distribution with scale parameter 1 and shape 
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parameter u > 0, with density 

l(0,oo)0)- 



For a positive vector a = (a±, . . . , a<j), we denote by D a the Dirichlet distribution 
on the simplex Sd := with density 

d oi-l 



i=l 



with respect to the (d — l)-dimensional volume measure on Sd- We denote by G a 
the distribution T ai ® ■ ■ • <g) r ad on .Mi^. 

For A = € .M+ c , the Dirichlet distribution Da on P r c is the law of the 

matrix X = (X itj ) G P r>c , such that X w := (X itU . . . , X i>c ) ~ D( a . l7 _ >a . c) and 
are independent. Similarly, by we denote the law of the matrix 
Z = (Z hj ) G M r , c , such that Z« := (Z 4jl , . . . , Z i>c ) ~ G K c) and Z« . . . , ZW 
are independent. 

Let A = G -M^ c . The following theorems are the main results in [6]. They 
are extensions of earlier theorems by Van Assche [21] , who proved them in the case 
c = r = 2 and all a^j = p > (see also [25] for an extension to finite dimensions of 
the result in [24] ) 

Theorem 1. (©) //(l^X) ~ D (ai .,..., Qr .) <8> D A , tfien FX ~ I>(a. 1 ,...,a. c ). 
Theorem 2. (©) If r = c = d,X~ D A> and 

(ai.,... , a rf .) = (a.i, ... ,a. d ), (5) 

£/ien G J^, and X^ ~ ^(ai.,...,a d .)- Furthermore, if Y is a random vector 

taking values in Sd that is independent of X , then YX = Y iffY~ D( aimj _ j(3ldm ). 

Theorem [2] is actually a simple consequence of Theorem [TJ Likewise, our exten- 
sion of Theorem |2] (Theorem H] below) is based on the following result that, as we 
show at the end of this section, implies Theorem [U 

Theorem 3. Let t = (t±, . . . ,t r ) G M.+ and s = (s\, . . . , s c ) G with t, = s.. 
Suppose X is a random element of M. TC independent of both Y ~ D t and V ~ G t . 
Then 

YX~D S iff VX~G S . (6) 

To state the next theorem, we will need the following conditions on the random 
matrix X G Vd- 

[CI] There exists a t G M.+ such that, for a random vector V ~ Gt independent of 
X, one has VX = V 



4 



[C2] For an i.i.d. sequence {X(n)} n >i with X(l) = X, there exists anm<oo such 
that F(X(m, 1) G Vf) > 0. 

Remark 1. Note that, unlike the conditions of Theorem [2] (under which [CI] holds), 
condition [CI] does not mean that X must be positive. In Section HI we provide 
examples where [CI] is met, but most of the entries of X are zeros. 



Theorem 4. (i) Relation 

C(X) G Xi 
holds iff [CI] and [C2] are met for X . 



(7) 



(ii) If (|7|) holds, then X^ ~ D t , where the vector t is the same as in [CI], and if 
Y is a random element of independent of X , then YX = Y iffY = X^\ 

We conclude this section by showing that the assertions of Theorems Q] and [2] 
do follow from those of our Theorems [3] and HJ Let £ = . . . , £<j) ~ Gt for some 
t G R+. Then it is well-known that (see for instance formula 2.1.2 in [T5] ) 



k 



A remarkable characteristic property of the gamma distribution is as follows 
([T7]). Suppose that ?7i , ?7 2 > are independent non-degenerate random variables. 
Then rji and 772 are gamma distributed with a common scale parameter iff rji + 772 is 
independent of 771/(771 + 772). It follows that 



£. is independent of ( . . . , ^ 



k 



and so 
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(£l, • • • j £d) — ( •••,— ) C» 



k 



(9) 



(10) 



where (£1, . . . ,£ d ) is an independent copy of 

We will now use OH]) and relation ffTUl) to show that Theorem [T] follows from 
Theorem EJ Let A = (a id ) G M+ c and (V, Z) ~ G («!.,. ..,a r .) <S> Ga- Then, we have 
from (ED that 



X :-- 



and 



Therefore 



Y :-- 



1 Zi. 



v:" 



z 1>c \ 

Zi. » 



y,x) ~ g 



(ai. 



D A , 



(11) 

(12) 
(13) 
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and 

(Y,X)~D {aim ^ a „, ) ®D A . (14) 

Now 




and relations ( TTUj) and ([TBI imply that the random vector on the right hand side of 
(|T5i) is distributed as 

r 

y^X Z k,l, Zk,c) = (Z.l, • • • , Z. c ) ~ G( Q . lv ..,a. c ). (16) 

fc=l 

It follows from Theorem [3] that for Y satisfying (Til"]) , one has i^X ~ -D(a.i,...,«. c ) ; 
thus establishing the claim of Theorem [TJ 

To see that Theorem [2] follows from Theorems [3] and IU we suppose that {X(n)} n >i 
are i.i.d. with law D A , and A satisfies (EJ). By Theorem PQ if Y ~ ^(ai.,...,a d .) 
is independent of X(l), then yj(l) ~ -0(a.i,...,a. d ), and Theorem |3] implies that 

VX(1) = V for V ~ G (a>1 ,...,«.,) independent of X(l). Thus [CI] holds with 
t = (a.i, . . . , a.d)- Since P(X(1) G Pj") = 1, [C2] is satisfied as well, and the 
assertion of Theorem [2] follows by applying Theorem HJ 



3 Proofs 

A remarkable observation made by Pitman [20] is the following extension of fl9]). 
Let, as above, £ ~ G t , and / : M. d — > M be a scale independent measurable function, 
i.e., for any a ^ 0, 

f(axt, . . .,ax d ) = f(xx, . . . ,x d ). 

Then the random variable • ••,£<*) is independent of 

The proof of Theorem [3] is based on the following extension of that observation 
to random functions. 

Lemma 1. Let (Q, J-", P) be a probability space, (E,£) a measurable space, and 
X : Q E a random element. Suppose H : W x E — >■ R + zs jointly measurable 
and, for any a^O and G f2, 

i7(a?/i, . . . , ay r , X(u)) = H(y u ...,y r , X{u)) 

for all (yi, . . . ,y r ) G R r . If V = (Vi, . . . , V r ) ~ G t , t G R+, independent of X , 
then V, is independent of H(V,X). 

Proof. Let 0(s,w), (s,w) G R+, denote the joint Laplace transform of V, and 
H(V,X). Then 

= ^ e -^+-+Vr)-uH {Vl ,..,V r ,X) 
poo poo 

= C t ■ ■■ vl 1 ' 1 ■ ■ ■ v^e-i^^k^-uHiv!,...^)^ . . . dv ^ 
Jo Jo 
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where C t 1 = Ilfc^^C^)- Making the substitution = (1 + s)vk and observing 
that H(vi, . . . , v r , X) = H(yi, . . . , y r , X) by virtue of scale independence, we have 
that 

/~1 roo /*oo 

0(s,u)= \ / •■• / y t r 1 ---y t r 1 e^Ee~ uH ^-y^dy 1 ---dy r , 
(l + s)^ tfe 7o Jo 

which is the product of two functions, one depending on s, and the other on u. 
Therefore V, and H(V,X) are independent as claimed. □ 

The next result is an obvious consequence of Lemma [TJ 

Corollary 1. Let V = (Vi, . . . ,V r ) ~ G t , and X = (X it j) e Ai r ,c be random 
elements independent of each other, X having positive row sums a.s. Define the 
function H(v\, . . . , v r , X) = (Hi(vi, . . . ,v r , X), . . . , H c (vi, . . . ,v r , X)) by 

H j (v 1 ,...,v r ,X):=J2^ JA > l<J<c (17) 



v, 

i=i 



Then H(V,X) is independent ofV,. 

Proof of Theorem^ Suppose the right relation in (E]) holds, i.e., VX ~ G s for 
V ~ G t independent of X. Then, by Corollary [U for the scale independent func- 
tion H defined in (JI7]L the random vector H(Vx, . . . , V r , X) = . . . , pFj X is 
independent of V,, and therefore 



v^*,...*W(«...,*,,v., 



where (V±, . . . ,V r ) ~ G t is independent of (V, X). Since VX ~ G s , for Z : = 
(Zi, . . . , Z c ) ~ G s , one has 

VX±Z±(%.,...,^)z m , (19) 



(Zi, . . . , Z c ) being an independent copy of Z (cf. (TTUi)). 

Equating the logarithms of the components of the vectors on the right hand sides 
of (HHD and (USD, we obtain that 

In , . . . , In f £*=i^M + l n (y.) ec 



ln(|^y...,lnf^VM^K, (20) 



ZJ' 7 \Z. 

where e c := (1, . . . , 1) G 
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Since t, = s 9 , one has V, — Z m as both follow T tt = T Sm , and so letting ip, (p 
and x denote the characteristic functions of the first, second (and fourth), and third 
terms in f ]2"U]) . respectively, we have 

^Ol, • • • , lt c M«l, • • • , U c ) = X(«l, • • • , UcM«l, • • • ) «c)> 

Noting that 

¥?(ui, . . . , O = Ee' M * ln(K) = — - / e iu ' lTlx x u ~ x e- x dx 

x^-Vdg = r( *V + W) ^ 



nU) J T(t.) 

for t, > 0, we conclude that ip = x an d therefore the respective random vectors 
have a common distribution. Hence one has 

I -4) x- php. • • • ■ ^HP) -{%■■■ ■ I) ^ (21) 

from (jHJ). Observing that the left hand side of (12TI) has the form for Y" ~ D t 
independent of X (cf. jS}), we have that YX ~ D s , and so the left relation in (jBJ) 
holds. 

Conversely, suppose that YX ~ D s for Y ~ .D t independent of X. Using the 
same steps as above, but following them in the reverse order, one can easily conclude 
that VX ~ G s for V ~ Gt independent of X. Theorem [3] is proved. □ 

We will need a simple extension of Proposition 2.2 from [6j. 

Proposition 1. LetX(l) be a random element ofVd satisfying [C2], and {X(n)} n >i 
be an i.i.d. sequence. Then there exists a random element W of Sd such that 

X(n,l)^e T d W (22) 

as n — y oo, where ej denotes the transpose of e&. Furthermore, if Y is a random 
element of Sd that is independent of X(l), then YX(1) = Y iffY— W. 

Proof. By Proposition 2.2 in [6], there exists a random element W of Sd such that 

X(nm, 1) ^> e T d W (23) 

as n — > oo, where m < oo is from [C2]. For k e (nm, {n + l)m) f~l N, one has 

X(k, 1) = X(Jfe, nm + l)X(nm, 1) 

= X(Jfe, nm + l)eJW + nm + 1) (X(nm, 1) - e T d W) 
= e d W + o(l) a.s. 

as n — > oo, since XeJ = ej for any X G Hence, the limit in ( 122]) exists and 
coincides with that in ( 123]) . The second part follows immediately from the proof of 
Proposition 2.2 in [5J. □ 



S 



Proof of Theorem^ (i) Suppose [CI] and [C2] hold. Then, by Proposition (H there 
exists a random element W of Sd such that 

X(n, 1) ^ X = e T d W 

as n — > oo, and therefore [Al] and [A2] hold. From condition [CI] and Theorem 
El it follows that YX(1) = Y for Y ~ D t independent of X{1). By Proposition [U 
the law of Y coincides with the distribution of the limit X^\ i — 1, 2, . . . , d, i.e. 

X(i) = x(2) = . . . = x(d) £ Y ~D t , (24) 

and so [A3] holds. We have proved that C(X) £ J^. 

Conversely, let C{X) £ J(f d . As X(m, 1) converges a.s. to X £ Pj", one must 
have X(m, 1) £ Pj~ for all large enough m, and so [C2] holds. Furthermore, clearly 

X = lim X(n, 1) = lim X(n, 2)X(1) = X'X(1), 

where X' = X is independent of X(l). In particular, one has X«X(1) = XW ~ A 

for some t £ Rj.. Applying Theorem El we have that VX(1) = V for V ~ G t , and 
so [CI] holds. 

(ii) If ([7]) holds, then by (i) we have that [CI] and [C2] are satisfied, in which 
case we have fl24"l) . where t is the same as in [CI]. Furthermore, for Y a random 

element of that is independent of X, one has by Proposition [T] that YX = Y iff 
Y = X^\ where X^ ~ D f by (j2I|). 

□ 

4 Examples and Applications 

4.1 Random exchange models 

Consider the following random exchange model, which is a discrete time analogue of 
certain continuous time Markov processes, called Potlatch models (see for instance 
[TT] and |.l6j). Suppose we have d < oo bins labelled by numbers 1,2, ... ,d, that hold 
amounts qk(n) , k = 1 d, of a homogeneous commodity at times n = 0,1,2, ... , 
respectively. The dynamics of the model is as follows: at time n > 1, the vector 
q{n — 1) := (gi(n — 1), . . . , q&(n — 1)) changes to q{n) := q{n — l)X{n), where 
{X(n)} n >i are i.i.d. random elements of Vd with distribution C(X), which generates 
the Markov chain 

q(n) = q(0)X(l,n), n>l. (25) 

Clearly J2t=i = Ylk=i Qk(®) f° r a ^ n ; anc l without loss of generality, we assume 

that this quantity is equal to one. 

As noted in [6], it is easy to prove that Markov chain fl25l) has a stationary 
distribution when C(X) £ J^. Indeed, defining the random maps K n : Sd —> Sd by 
K n {y) := yX{n), one has that, for any y £ Sd, 

K x o • • • o K n (y) = yX(n, 1) X (1) as n ->■ oo. 
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Applying Proposition 1 in 0, we obtain that C(X^) is the stationary distribution 
of Markov chain f )25|) . 

The random exchange model (I25p is a higher dimension analogue of the stochastic 
give-and-take model studied in [7] and introduced in its deterministic form in the 
context of human genetics in [15]. This model is applicable to the study of dynamical 
systems (see for instance [1]), e.g. it provides a model of a closed economy, where 
agents exchange real wealth at each step in a random manner. 

Now we consider two special cases of random stochastic matrices X with C(X) £ 
J%d- The purpose of these examples is to demonstrate that none of the conditions 
[A1]-[A3] from (6] is necessary. We will also discuss the respective special cases 
of random exchange model fl25|) . Our first example leads to a generalisation of the 
result in (6] to the case of "extended" Dirichlet distributions, where the parameters 
of the distribution are permitted to be zero. Our second example demonstrates that 
the rows of X need not be independent. 

Example 1. For a vector a £ Aii^d, we set D a to be the weak limit of the distri- 
bution D a+e as e j. 0, with the usual interpretation of the sum a + e with a £ M. d 
and e £ IR. In other words, the components of Y ~ D a that correspond to zero 
components of a are identically zero, whereas the subvector of Y consisting of the 
components Yj of that random vector that correspond to aj > form a usual Dirich- 
let distributed vector. Likewise, for a matrix A £ M. r>c , we set Da to be the weak 
limit of the distribution of Da+e as e J, 0. We define the distributions G a and Ga in 
a similar way. 

Let A = (ccjj) £ M. f i with ctj. = a.i > 0, for i = 1, . . . , d. If X 
satisfies [C2], then it turns out that C(X) £ and X^ ~ D 



Da and X 
a d .), and so 



D 



(ai 



id> ) is the stationary distribution of Markov chain (125]) . 

= V for V 



G 



(«i.,...,a d .) 



Indeed, by Theorem H] it suffices to show that VX 
independent of X. To do that, let (V,Z) ~ CVai.,...,^.) ® an d define X E Vd 
by ffTT]) . Then V is independent of X, and ~ GV Ql>i ... jad> ) by f TTS]) and f TTB]) . as 
required. 

In particular, we have obtained the stationary distribution for the following sim- 
ple model, which, to the best of the author's knowledge, has not previously been 
studied: at time n > 1, a uniform proportion of the commodity previously held in 
bin k, k = 1,2, ... ,d, is shifted to the (neighbouring) bin k + 1 (mod d). In this 
case vector q{n) is defined by fl25|) with 



X :-- 



( U x 




\i-u d 



l-U l 
u 2 l-U 2 











\ 





U d _ x l-t/ d _i 

o u d 



(26) 



where the £4, k = l,...,d, are i.i.d. uniformly distributed random variables on 
(0, 1). Observing that X defined by (B6"]) satisfies [C2] for m = d — 1, we have from 
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above that ~ Dp 2), an d so -D(2,...,2) is the stationary distribution of Markov 
chain (1251) with X(n) distributed as (1261) . 

Example 2. In this example, we consider a random stochastic matrix X with all 
rows dependent, which generates Markov chain (1251) with the same stationary distri- 
bution as that corresponding to the random stochastic matrix fl26|) . The behaviour 
of this model is controlled by the decisions of a "leader" as follows. At time n > 1, 
the "leader" shifts a uniform proportion of the commodity held in bin 1 to bin 2. If 
the proportion shifted is greater than 1/2, then no other shifts occur in the system 
at time n. However, if the proportion shifted is less than or equal to 1/2, then 
the commodity previously held in bin k, k = 2,3, ... ,d, d > 2, is shifted to the 
(neighbouring) bin k + 1 (mod d). In this model, the random vector q(n) is given 
by (125]) . with the random element X defined by 



/ U l-U 
/ 1-1 



X :-- 



\ 













1 1- 1 
■ ■01/ 



(27) 



where U is a uniform random variable on (0,1), and / := 1{(/>i/2}, 1a being the 
indicator function for event A. 

We will show that C(X) e Jt d and XM ~ £>(2,...,2)- By Theorem H it suffices to 
show that [CI] and [C2] hold for X, t = (2, . . . , 2) being the vector from [CI]. It is 
not hard to directly verify that X defined by (1271) satisfies [C2] for m = 2d — 2. Now 
let V ~ G(2,...,2) be independent of X, and denote by (p(ui, . . . , Ud) the characteristic 
function of VX. Then, setting u d+ i := u\, we have 



<f(uj 



Eexp \ i( Ul U + u 2 {l - U))Vi +i ^T{ujl + u j+1 {l - I))Vj 



E exp < i Uj+iVj > / 
I i=2 J Jo 

r d 

+ E exp < i UjVj 



i=2 



1/2 



ds 



+ 



i=2 
(l-m 2 ) 2 

nLiC 1- ^*) 2 
a-»i) 2 




[1 - i(ms + u 2 (l - s))]' 



ds 



1/2 [1 - l(«iS + u 2 (l - s))] : 



1 - iu 2 )(2i + ui + M2) 



fc=i 
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which is the characteristic function of V as well, and so we conclude that VX = V. 
Therefore [CI] holds with t — (2, . . . , 2), as required. 

4.2 Random nested simplices 

The study of random triangles and, in particular, nested sequences of random tri- 
angles, has been extensive in the probabilistic literature (see for instance [H] and 
references therein). In one such study [14] . the authors use Theorem 1.2 in [6] (see 
Theorem [2] above) to prove that the limiting distribution of certain random nested 
simplices is Dirichlet. More precisely, they consider a real (d— l)-dimensional affine 
space E and a convex hull of d points p x , . . . ,p d in E which forms a simplex S. 
Choosing d points Pi(l), • • • iPJS) lJ1 $ and taking their convex hull yields a new 
simplex S(l) C S. Similarly, choosing points Pi(2), . . . , p d (2) in S{1) to obtain S(2) 
spanned by them, and continuing in this fashion, we obtain a sequence {S(n)} n >i 
of nested simplices. 

Suppose that, in the above procedure, the points are chosen at random using 
the following mechanism. Let {X(n)} n >i be a sequence of i.i.d. elements of Vd 
with distribution Da, A G Aid satisfying (j5J). Then, for each n = 1,2,... and 
k = l,...,d, the point p k {n) is specified by its barycentric coordinates given by 
X^ k \n) with respect to (Pi(n — 1), . . . ,p d (n — 1)). As shown in [14] , the so defined 
simplices S(n) shrink to a random point Y G E as n — > oo: 

p k (n)^Y, k = l,...,d, (28) 

where the barycentric coordinates of Y with respect to (p 1 , . . . ,p d ) are Dirichlet 
distributed with parameter (cm., . . . , a dt ). We extend this result as follows (see 
Section 2 in [13] for definitions related to affine spaces, affine frames and barycentric 
coordinates): 

Theorem 5. If X(l) satisfies [CI] and [C2], then ( 128|) holds true. Furthermore, 
the barycentric coordinates ofY with respect to (p 1; . . . ,p d ) have distribution Dt, t 
being the vector from [C2]. 

Proof. The proof of Theorem [5] basically repeats that of Theorem 2.2 from |14j . 
except we use Theorem |4] in place of Theorem [2] above. For B(0) G S d , define 
Z{n) G S(n) as the point with barycentric coordinates B(0) with respect to the 
affine frame (p 1 (n), . . . ,p d (n)), and denote by B(n) the barycentric coordinates of 
Z(n) with respect to the affine frame (p 1 , . . . ,p d ). Then, as shown in the proof of 
Theorem 2.2 from [13], one has 

B{n) = B(0)X(n,l), n = 0, 1,... 

Now suppose X(l) satisfies [Cl] and [C2]. By Theorem U C{X{1)) G Xi, with 
X« = (Fi, . . . , Y d ) ~ D t , t being the vector from [C2]. Then clearly B(n) ^ X^ 
as n — > oo for any B(0). In particular, for any k = 1, . . . , d, we can set B(0) : = 
(0, . . . , 0, 1, 0, . . . , 0), where 1 is at the kth place, to obtain that p k (n) Y as n — > 
oo, where Y has barycentric coordinates (Yi, . . . , Y d ) with respect to (p ly . . . ,p d ), 
as required. □ 
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4.3 Service networks with polling 

The well-known story of Buridan's donkey motivated the authors of [22] to consider 
the following model. Let p 1 , . . . ,p d be d > 2 points in the plane. Starting at a 
point R(0) inside the convex hull of (p 1 , . . . ,p d ) at time 0, at time td + r, t > 0, 
r £ {1, . . . , d}, the donkey moves from the point R(td + r — 1) to a point R{td + r) 
which is uniformly distributed on the straight line segment connecting the points 
R{td + r — 1) and p r . As noted in [T3], although it is easy to prove existence of 
the stationary distributions for the d homogeneous Markov chains {R(td + r)} t > , 
r = 1, . . . d, their computation for d > 2 is a difficult problem (the authors of [22] 
focus on the case where d = 2). 

A modification of that scheme in which the donkey travels inside a simplex S 
with vertices p 1 ,...,p d in R d_1 by choosing on step td + r its new position at 
random inside the convex hull spanned by its current location and vectors from the 
set {pi, . . . , Pd}\{p r } was considered in [13]. The main result of [13] establishing the 
form of the stationary distribution of the thus modified donkey walk was actually 
proved for yet another version of the model. Namely, rather than the current position 
of the donkey determining the range for the next step, one instead takes a point with 
barycentric coordinates (with respect to the affine frame (p 1 ,...,p d )) taken from 
rows of i.i.d. random stochastic matrices. 

Specifically, let X := (Xij) be a random element of Vd, and {X(t)} t >o be a 
sequence of i.i.d. random matrices with distribution C(X). The donkey's position 
at time n is specified by the vector B{n) = (Bi(n), . . . , B d {n)) £ S d of its barycentric 
coordinates with respect to the affine frame (p ± , . . . ,p d )- Starting at a non-random 
point -B(O), for r = 1, . . . , d and t = 0, 1, ... , given B(td + r — 1) = (xi, . . . , x d ), 
one has 

B(td + r) = (xi, . . . ,x r -i,0,x r +i, • • • ,Xd) + (x r X rA (t), . . . ,x r Xr td (t)). (29) 

In other words, setting p r := Ylj=i X T j(t)Pp the location of the donkey at time td+r 
will have the same barycentric coordinates in the frame (p ± , . . . ,P r -i,P r ,P r+ i,Pd) 
as those those for the donkey's location at time td + r — 1 in the original frame 
(p 1; . . . , p d ). It is clear that each of the sequences {B(td + r)} t >o, r = 1, . . . , d, 
forms a homogeneous Markov chain. 

This model actually describes the progression of a customer in the following 
discrete-time closed network with polling. Suppose there are d buffer nodes in the 
network, accessed in cyclic order by a single server, with customers never leaving the 
network. At time n — 1, the server accesses node 1 and moves customers from that 
node to other nodes in the network randomly, with transition probabilities taken 
from the first row of a random X(l) £ Vd- At time n < d, the server accesses node 
n, and similarly moves customers from that node to other nodes with transition 
probabilities taken from the nth row of X(l). The same procedure is repeated in 
a cyclical fashion, with the server accessing node 1 at time n = d + 1, and so on, 
with transition probabilities taken from the rows of random elements X(j) £ Vd for 
the jth cycle. In this formulation, the vector B(n) represents the distribution of 
customers in the network after n steps, with initial distribution B(0). 
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Markov chains with the cyclic property that each of the sequences {B(td+r)} t >o, 
r — 1, . . . , d, forms a homogeneous Markov chain, were also considered in [8], where 
several examples of such Markov chains were provided. 

The following assertion was proved in [T3] (instead of the stated Theorem 3 
concerning the "true donkey walk" discussed at the beginning of this section). 

Theorem 6. ([IS]) Let A = (a i:j ) G M.\ satisfy ©, and X ~ D A . Then, for r = 
1, . . . , d, the stationary distribution of the rth homogeneous Markov chain {B(td + 
r)}t>o given by ( |29|) is Dirichlet with parameters ((3 r ,i, ■ ■ ■ , Pr,d) defined by 

Pr,j ■= 1 s^d+r , ■ ^ (30) 

with the convention that ccjj = aiij> when j = j' (mod d) . 
For a matrix X = (X^j) G Aid and r — 1, 2, . . . , d, set 

(T r (X)).. := (Sij + X r j5 ri - SrjSij), 1 < i,j < d, 

Sij being Kroneker's delta (i.e., T r (X) is the d x d identity matrix with the rth row 
replaced by the rth row of X). 

An extension of Theorem O is given by the following theorem. 

Theorem 7. Let X be a random element ofVd such that T±(X) ■ ■ - Td(X) satisfies 
[C2]. Let V = {Vij) ~ Gb be a random matrix independent of X , where B G KA\ 
has entries (3 it j given by (130 p . Furthermore, suppose that 

y{r) £ v^TriX), l<r<d, (31) 

where V® := V® . 

Then, forr = 1, . . . , d, the stationary distribution of the rth homogeneous Markov 
chain {B(td + r)} t >o is Dirichlet with parameters ((3 r ,i, ■ ■ ■ , f3 r ,d)- 

Proof. Let {X(t)} t >o be i.i.d. with law C(X), and fix r G {1, . . . , d}. Then 

T l (X(t))---T d (X(t)) iorr = d, 

T r+1 (X(t)) ■ ■■T d (X(t))T 1 (X(t)) ■ ■ -T r (X(t)) for r < d, 

t = 0, 1, . . ., are i.i.d. random elements in Vd, and 

B(td + r) = B(r)R(l,t), 

where R{l,t) is given by ©. Clearly, R(l) satisfies [C2] since T X (X) ■ ■ -T d (X) 
satisfies [C2], and so by Proposition [fl there exists a random vector W G S d such 
that 

R(t, 1) ^ e T d W 

as t — > oo. Using Proposition 1 in [5], applied to the space S d and to the random 
mappings from S d to Sd defined by x h-> xZ(t), t > 0, it follows that {B(td + r)} t > 
has a unique stationary distribution. 

Now V {r) = V {r) R(l) by ([31]), and therefore Theorem 1 implies that -R (1) ~ 
D(p r ^_£ rd ). Hence the stationary distribution of {B(td + r)} t > is -D(/3 rl ,...,/3 rd ) as 
required. □ 



R(t) 
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The following remark shows that our Theorem [7| is indeed an extension of The- 
orem [6j 

Remark 2. Let A = {oc^j) G Aid be such that «j. = ct,i > for % — 1, . . . , d, and 
B = (Pij) G M.^, where the numbers are given by (IHUj) . Let (V, Z) ~ G B ® Ga, 
and X ~ Da defined by <^3) be such that T X (X^) ■ ■ ■ T d (X^) satisfies [C2]. Then, 
for r = 1, . . . , d and using the convention that r — 1 = d for r = 1, one has 



where the equality in distribution holds by fTTOj) and the fact that V r 

both follow Tp r _ lr = T arm . It remains to observe that /3 r>r = a r>r and, for j 7^ r, 

A-j = A-ij + &r,j, hence the vector in the last line above is distributed as V^. 

The conditions of Theorem [7] are met, and we conclude that, for all r — 1, . . . , d, 
the homogeneous Markov chain {B(td+r)} t > has stationary distribution D(g r l fi r d ) 
In particular, for X defined by (|26|) . one has (3 r j = 1 for j 7^ r + 1 (mod d), and 
(3 r> j = 2 for j = r + 1 (mod d). 
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